AITopics | Memory

Collaborating Authors

Memory

News Overviews Instructional Materials AI-Alerts Classics

Linear-Memory and Decomposition-Invariant Linearly Convergent Conditional Gradient Algorithm for Structured Polytopes

Neural Information Processing SystemsMar-17-2026, 11:33:59 GMT

Recently, several works have shown that natural modifications of the classical conditional gradient method (aka Frank-Wolfe algorithm) for constrained convex optimization, provably converge with a linear rate when the feasible set is a polytope, and the objective is smooth and strongly-convex. However, all of these results suffer from two significant shortcomings: i) large memory requirement due to the need to store an explicit convex decomposition of the current iterate, and as a consequence, large running-time overhead per iteration ii) the worst case convergence rate depends unfavorably on the dimension In this work we present a new conditional gradient variant and a corresponding analysis that improves on both of the above shortcomings. In particular, both memory and computation overheads are only linear in the dimension, and in addition, in case the optimal solution is sparse, the new convergence rate replaces a factor which is at least linear in the dimension in previous works, with a linear dependence on the number of non-zeros in the optimal solution At the heart of our method, and corresponding analysis, is a novel way to compute decomposition-invariant away-steps. While our theoretical guarantees do not apply to any polytope, they apply to several important structured polytopes that capture central concepts such as paths in graphs, perfect matchings in bipartite graphs, marginal distributions that arise in structured prediction tasks, and more. Our theoretical findings are complemented by empirical evidence that shows that our method delivers state-of-the-art performance.

artificial intelligence, machine learning, proceedings, (10 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.62)
Information Technology > Hardware > Memory (0.59)

Add feedback

Neural Modulation for Flash Memory: An Unsupervised Learning Framework for Improved Reliability

Neural Information Processing SystemsFeb-17-2026, 10:57:41 GMT

The continued scaling of flash memory technology into smaller process nodes, combined with the increased information capacity of each flash cell (i.e, storing more bits per cell), has placed NAND flash memory at the forefront of modern storage technology.

artificial intelligence, machine learning, modulator, (19 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel (0.04)

Technology:

Information Technology > Hardware > Memory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.51)

Add feedback

Amazon just unleashed its Cyber Monday laptop deals and it's dropping prices on MacBooks, gaming PCs, and more

Gear Computers Laptops Amazon just unleashed its Cyber Monday laptop deals and it's dropping prices on MacBooks, gaming PCs, and more Whether you need a basic everyday driver or a full-featured gaming PC, Amazon's Cyber Monday laptop can save you cash. We may earn revenue from the products available on this page and participate in affiliate programs. A laptop is a big investment. Not only do they typically cost a lot of money, but you're committing a machine you'll stare at while you shop, do homework, remote work, game, and pretty much everything else in your online life. Amazon just dropped its Cyber Monday deals on laptops and these are some of the lowest prices we have seen all year.

artificial intelligence, laptop, macbook air, (16 more...)

Popular Science

Country: North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.04)

Industry:

Retail > Online (1.00)
Leisure & Entertainment > Games > Computer Games (0.34)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Hardware > Memory (0.33)

Add feedback

Black Friday 2025 could be your last chance for cheap PC deals, experts warn

PCWorldNov-26-2025, 20:28:49 GMT

When you purchase through links in our articles, we may earn a small commission. AI is causing a DRAM apocalypse and it's affecting the whole PC market this holiday season. This year, Black Friday tech shoppers should heed one important message: Don't wait, buy now. Because certain components are skyrocketing in price--and it's expected to get even worse. DRAM prices, for example, have doubled in little more than a month. AI hyperscalers have snapped up whatever they can buy.

artificial intelligence, pcpartpicker, security software storage streaming wi-fi, (10 more...)

PCWorld

Country:

Asia > China (0.47)
North America > United States > California (0.04)

Industry:

Information Technology (1.00)
Government > Regional Government (0.94)
Banking & Finance (0.94)
Retail > Online (0.65)

Technology:

Information Technology > Artificial Intelligence (0.67)
Information Technology > Hardware > Memory (0.30)

Add feedback

Zero-Knowledge Proofs in Sublinear Space

Nye, Logan

arXiv.org Artificial IntelligenceSep-18-2025

Zero-knowledge proofs allow verification of computations without revealing private information. However, existing systems require memory proportional to the computation size, which has historically limited use in large-scale applications and on mobile and edge devices. We solve this fundamental bottleneck by developing, to our knowledge, the first proof system with sublinear memory requirements for mainstream cryptographic constructions. Our approach processes computations in blocks using a space-efficient tree algorithm, reducing memory from linear scaling to square-root scaling--from $Θ(T)$ to $O(\sqrt{T} + \log T \log\log T)$ for computation size $T$--while maintaining the same proof generation time through a constant number of streaming passes. For widely-used linear polynomial commitment schemes (KZG/IPA), our method produces identical proofs and verification when using the same parameters and hashing only aggregate commitments into the challenge generation, preserving proof size and security. Hash-based systems also achieve square-root memory scaling though with slightly different proof structures. This advance enables zero-knowledge proofs on everyday devices and makes previously infeasible large computations verifiable, fundamentally democratizing access to privacy-preserving computation. Space-efficient zero knowledge proof systems create opportunities to reshape how trust is established in digital systems--from enabling widespread participation in decentralized networks to making verifiable scientific computing practical at unprecedented scales.

artificial intelligence, blk, computation, (15 more...)

arXiv.org Artificial Intelligence

2509.05326

Country: North America > United States (0.28)

Genre: Research Report (0.40)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence (0.46)
Information Technology > Hardware > Memory (0.34)

Add feedback

Memory-Efficient FastText: A Comprehensive Approach Using Double-Array Trie Structures and Mark-Compact Memory Management

Du, Yimin

arXiv.org Artificial IntelligenceJun-3-2025

FastText has established itself as a fundamental algorithm for learning word representations, demonstrating exceptional capability in handling out-of-vocabulary words through character-level n-gram embeddings. However, its hash-based bucketing mechanism introduces critical limitations for large-scale industrial deployment: hash collisions cause semantic drift, and memory requirements become prohibitively expensive when dealing with real-world vocabularies containing millions of terms. This paper presents a comprehensive memory optimization framework that fundamentally reimagines FastText's memory management through the integration of double-array trie (DA-trie) structures and mark-compact garbage collection principles. Our approach leverages the linguistic insight that n-grams sharing common prefixes or suffixes exhibit highly correlated embeddings due to co-occurrence patterns in natural language. By systematically identifying and merging semantically similar embeddings based on structural relationships, we achieve compression ratios of 4:1 to 10:1 while maintaining near-perfect embedding quality. The algorithm consists of four sophisticated phases: prefix trie construction with embedding mapping, prefix-based similarity compression, suffix-based similarity compression, and mark-compact memory reorganization. Comprehensive experiments on a 30-million Chinese vocabulary dataset demonstrate memory reduction from over 100GB to approximately 30GB with negligible performance degradation. Our industrial deployment results show significant cost reduction, faster loading times, and improved model reliability through the elimination of hash collision artifacts. Code and experimental implementations are available at: https://github.com/initial-d/me_fasttext

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2506.01254

Genre: Research Report > New Finding (0.48)

Industry: Water & Waste Management > Solid Waste Management (0.56)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Hardware > Memory (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Bayesian Reasoning Enabled by Spin-Orbit Torque Magnetic Tunnel Junctions

Xu, Yingqian, Li, Xiaohan, Wan, Caihua, Zhang, Ran, He, Bin, Liu, Shiqiang, Xia, Jihao, Kong, Dehao, Xiong, Shilong, Yu, Guoqiang, Han, Xiufeng

arXiv.org Artificial IntelligenceApr-14-2025

The rapid development of artificial intelligence (AI) over the past few decades has been nourished by advancements in machine learning algorithms, increased computational power, and availability of vast amounts of data[1], which has in turn revolutionized numerous fields including but not limited to medical science and healthcare, information technologies, finance, transportation, and more. This regenerative feedback between AI and its applications leads to a further explosive growth of data and expansion of model scales, which calls for a paradigm shift toward efficient and speedy computing and memory technologies, especially, advanced algorithms and emerging AI hardware enabled by nonvolatile memories[2]. In this aspect, the emerging memory technologies, such as magnetic random-access memories[3], ferroelectric random-access memories[4], resistive random-access memories[5, 6] and phase-change random-access memories[7], have been implemented to accelerate AI computing, for instance, the matrix multiplication[8]. Thanks to their high energy-efficiency, fast speed, long endurance, and versatile functionalities, spin-tronic devices based on spin-orbit torques as one prominent example among emerging memories, have shown great potential in the aspect of hardware-accelerated true random number generation (TRNG)[9-18] besides of the matrix multiplication. For instance, the high quality true random number generators with stable and reconfigurable probability-tunability have been demonstrated using SOT -MTJs [19-21].

artificial intelligence, bayesian network, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2504.08257

Country: Asia > China (0.15)

Genre: Research Report (1.00)

Industry:

Health & Medicine (1.00)
Semiconductors & Electronics (0.95)

Technology:

Information Technology > Hardware > Memory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

AMD Radeon RX 9070 and 9070 XT review: The new 1440p gaming champions

PCWorldMar-5-2025, 14:00:13 GMT

Some software bugs mar the experience but overall, AMD's 9070 graphics cards offer such a compelling mix of performance, value, and memory capacity that it's worth accepting those quibbles. Nvidia fumbled the ball with its 549 GeForce RTX 5070, and AMD's new Radeon RX 9070 and 9070 XT are primed to seize advantage. The RTX 5070, hitting store shelves today, is a good 1440p graphics card but a stagnant generational sidegrade at best. Enter the 549 Radeon RX 9070 and 599 Radeon RX 9070 XT, launching tomorrow. Both cards are faster than the RTX 5070, with the 9070 XT going toe-to-toe with the 750 RTX 5070 Ti in many games, and each includes an ample 16GB of VRAM.

artificial intelligence, radeon rx 9070, rtx 5070, (17 more...)

PCWorld

Country:

North America > United States > Indiana (0.04)
Asia > Vietnam > Long An Province (0.04)

Genre:

Research Report (0.40)
Overview (0.40)

Industry:

Leisure & Entertainment > Sports (0.35)
Leisure & Entertainment > Games (0.35)

Technology:

Information Technology > Artificial Intelligence (0.69)
Information Technology > Hardware > Memory (0.30)

Add feedback

Reviews: Large Memory Layers with Product Keys

Neural Information Processing SystemsJan-26-2025, 01:36:58 GMT

UPDATE: Authors answered my questions, I would like to keep my score unchanged and suggest to focus on clarity of the final version. Perhaps, this is the case when I would really be interested in looking at the source code. Originality: the paper borrows the general idea of product keys from the database community, however the application to fast retrieval in neural memory systems seems quite novel to me. Quality: The core ideas of the paper are sound, however more I would appreciate more rigor in both conceptual and experimental comparison with other approaches incorporating memory to Transformer (see e.g. Another suggestion would be to discuss more the issue of potential non-uniformity of the query distribution, which indeed seems to be quite relevant.

memory layer, product key, transformer, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Hardware > Memory (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.39)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.36)

Add feedback

Managed-Retention Memory: A New Class of Memory for the AI Era

Legtchenko, Sergey, Stefanovici, Ioan, Black, Richard, Rowstron, Antony, Liu, Junyi, Costa, Paolo, Canakci, Burcu, Narayanan, Dushyanth, Wu, Xingbo

arXiv.org Artificial IntelligenceJan-16-2025

AI clusters today are one of the major uses of High Bandwidth Memory (HBM). However, HBM is suboptimal for AI workloads for several reasons. Analysis shows HBM is overprovisioned on write performance, but underprovisioned on density and read bandwidth, and also has significant energy per bit overheads. It is also expensive, with lower yield than DRAM due to manufacturing complexity. We propose a new memory class: Managed-Retention Memory (MRM), which is more optimized to store key data structures for AI inference workloads. We believe that MRM may finally provide a path to viability for technologies that were originally proposed to support Storage Class Memory (SCM). These technologies traditionally offered long-term persistence (10+ years) but provided poor IO performance and/or endurance. MRM makes different trade-offs, and by understanding the workload IO patterns, MRM foregoes long-term data retention and write performance for better potential performance on the metrics important for these workloads.

managed-retention memory, requirement, workload, (15 more...)

arXiv.org Artificial Intelligence

2501.09605

Country:

North America > United States > New York > New York County > New York City (0.04)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(5 more...)

Genre: Research Report (0.40)

Industry:

Information Technology (0.47)
Semiconductors & Electronics (0.46)

Technology:

Information Technology > Hardware > Memory (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback